88 research outputs found

    Auxiliary Learning for Self-Supervised Video Representation via Similarity-based Knowledge Distillation

    Full text link
    Despite the outstanding success of self-supervised pretraining methods for video representation learning, they generalise poorly when the unlabeled dataset for pretraining is small or the domain difference between unlabelled data in source task (pretraining) and labeled data in target task (finetuning) is significant. To mitigate these issues, we propose a novel approach to complement self-supervised pretraining via an auxiliary pretraining phase, based on knowledge similarity distillation, auxSKD, for better generalisation with a significantly smaller amount of video data, e.g. Kinetics-100 rather than Kinetics-400. Our method deploys a teacher network that iteratively distills its knowledge to the student model by capturing the similarity information between segments of unlabelled video data. The student model meanwhile solves a pretext task by exploiting this prior knowledge. We also introduce a novel pretext task, Video Segment Pace Prediction or VSPP, which requires our model to predict the playback speed of a randomly selected segment of the input video to provide more reliable self-supervised representations. Our experimental results show superior results to the state of the art on both UCF101 and HMDB51 datasets when pretraining on K100 in apple-to-apple comparisons. Additionally, we show that our auxiliary pretraining, auxSKD, when added as an extra pretraining phase to recent state of the art self-supervised methods (i.e. VCOP, VideoPace, and RSPNet), improves their results on UCF101 and HMDB51. Our code is available at https://github.com/Plrbear/auxSKD

    PECoP: Parameter Efficient Continual Pretraining for Action Quality Assessment

    Full text link
    The limited availability of labelled data in Action Quality Assessment (AQA), has forced previous works to fine-tune their models pretrained on large-scale domain-general datasets. This common approach results in weak generalisation, particularly when there is a significant domain shift. We propose a novel, parameter efficient, continual pretraining framework, PECoP, to reduce such domain shift via an additional pretraining stage. In PECoP, we introduce 3D-Adapters, inserted into the pretrained model, to learn spatiotemporal, in-domain information via self-supervised learning where only the adapter modules' parameters are updated. We demonstrate PECoP's ability to enhance the performance of recent state-of-the-art methods (MUSDL, CoRe, and TSA) applied to AQA, leading to considerable improvements on benchmark datasets, JIGSAWS (↑6.0%\uparrow6.0\%), MTL-AQA (↑0.99%\uparrow0.99\%), and FineDiving (↑2.54%\uparrow2.54\%). We also present a new Parkinson's Disease dataset, PD4T, of real patients performing four various actions, where we surpass (↑3.56%\uparrow3.56\%) the state-of-the-art in comparison. Our code, pretrained models, and the PD4T dataset are available at https://github.com/Plrbear/PECoP.Comment: Accepted to WACV 2024 (preprint

    Comparison of Test Your Memory and Montreal Cognitive Assessment Measures in Parkinson’s Disease

    Get PDF
    Background. MoCA is widely used in Parkinson’s disease (PD) to assess cognition. The Test Your Memory (TYM) test is a cognitive screening tool that is self-administered. Objectives. We sought to determine (a) the optimal value of TYM to discriminate between PD patients with and without cognitive deficits on MoCA testing, (b) equivalent MoCA and TYM scores, and (c) interrater reliability in TYM testing. Methods. We assessed the discriminant ability of TYM and the equivalence between TYM and MoCA scores and measured the interrater reliability between three raters. Results. Of the 135 subjects that completed both tests, 55% had cognitive impairment according to MoCA. A MoCA score of 25 was equivalent to a TYM score of 43-44. The area under the receiver operator characteristic (ROC) curve for TYM to differentiate between PD-normal and PD-cognitive impairment was 0.82 (95% CI 0.75 to 0.89). The optimal cutoff to distinguish PD-cognitive impairment from PD-normal was ≤45 (sensitivity 90.5%, specificity 59%) thereby correctly classifying 76.3% of patients with PD-cognitive impairment. Interrater agreement was high (0.97) and TYM was completed in under 7 minutes (interquartile range 5.33 to 8.52 minutes). Conclusions. The TYM test is a useful and less resource intensive screening test for cognitive deficits in PD

    A Time Series Approach to Parkinson's Disease Classification from EEG

    Full text link
    Firstly, we present a novel representation for EEG data, a 7-variate series of band power coefficients, which enables the use of (previously inaccessible) time series classification methods. Specifically, we implement the multi-resolution representation-based time series classification method MrSQL. This is deployed on a challenging early-stage Parkinson's dataset that includes wakeful and sleep EEG. Initial results are promising with over 90% accuracy achieved on all EEG data types used. Secondly, we present a framework that enables high-importance data types and brain regions for classification to be identified. Using our framework, we find that, across different EEG data types, it is the Prefrontal brain region that has the most predictive power for the presence of Parkinson's Disease. This outperformance was statistically significant versus ten of the twelve other brain regions (not significant versus adjacent Left Frontal and Right Frontal regions). The Prefrontal region of the brain is important for higher-order cognitive processes and our results align with studies that have shown neural dysfunction in the prefrontal cortex in Parkinson's Disease

    Multimodal Classification of Parkinson's Disease in Home Environments with Resiliency to Missing Modalities

    Get PDF
    Parkinson’s disease (PD) is a chronic neurodegenerative condition that affects a patient’s everyday life. Authors have proposed that a machine learning and sensor-based approach that continuously monitors patients in naturalistic settings can provide constant evaluation of PD and objectively analyse its progression. In this paper, we make progress toward such PD evaluation by presenting a multimodal deep learning approach for discriminating between people with PD and without PD. Specifically, our proposed architecture, named MCPD-Net, uses two data modalities, acquired from vision and accelerometer sensors in a home environment to train variational autoencoder (VAE) models. These are modality-specific VAEs that predict effective representations of human movements to be fused and given to a classification module. During our end-to-end training, we minimise the difference between the latent spaces corresponding to the two data modalities. This makes our method capable of dealing with missing modalities during inference. We show that our proposed multimodal method outperforms unimodal and other multimodal approaches by an average increase in F1-score of 0.25 and 0.09, respectively, on a data set with real patients. We also show that our method still outperforms other approaches by an average increase in F1-score of 0.17 when a modality is missing during inference, demonstrating the benefit of training on multiple modalities
    • …
    corecore